Sharp Oracle Inequalities in Low Rank Estimation
نویسنده
چکیده
The paper deals with the problem of penalized empirical risk minimization over a convex set of linear functionals on the space of Hermitian matrices with convex loss and nuclear norm penalty. Such penalization is often used in low rank matrix recovery in the cases when the target function can be well approximated by a linear functional generated by a Hermitian matrix of relatively small rank (comparing with the size of the matrix). Our goal is to prove sharp low rank oracle inequalities that involve the excess risk (the approximation error) with constant equal to one and the random error term with correct dependence on the rank of the oracle. 1 Main Result Let (X,Y ) be a couple, whereX is a random variable in the space Hm ofm×m Hermitian matrices and Y is a random response variable with values in a Borel subset T ⊂ R. Let P be the distribution of (X,Y ) and let Π denote the marginal distribution of X. The goal is to predict Y based on an observation of X. More precisely, let l : T × R 7→ R+ be a measurable loss function. We will assume in what follows that, for all y ∈ T, l(y; ·) is convex. Given a measurable function f : Hm 7→ R (a “prediction rule”), denote (l • f)(x, y) := l(y; f(x)) and define the risk of f as P (l • f) = El(Y ; f(X)). Partially supported by NSF Grants DMS-1207808, DMS-0906880 and CCF-0808863
منابع مشابه
Adaptive Estimation of and Oracle Inequalities for Probability Densities and Characteristic Functions
The theory of adaptive estimation and oracle inequalities for the case of Gaussian-shift–finite-interval experiments has made significant progress in recent years. In particular, sharp-minimax adaptive estimators and exact exponential-type oracle inequalities have been suggested for a vast set of functions including analytic and Sobolev with any positive index as well as for Efromovich–Pinsker ...
متن کاملVon Neumann Entropy Penalization and Low Rank Matrix Estimation
We study a problem of estimation of a Hermitian nonnegatively definite matrix ρ of unit trace (for instance, a density matrix of a quantum system) based on n i.i.d. measurements (X1, Y1), . . . , (Xn, Yn), where Yj = tr(ρXj) + ξj , j = 1, . . . , n, {Xj} being random i.i.d. Hermitian matrices and {ξj} being i.i.d. random variables with E(ξj |Xj) = 0. The estimator ρ̂ := argminS∈S [ n−1 n ∑ j=1 (...
متن کاملOptimal estimation of low rank density matrices
The density matrices are positively semi-definite Hermitian matrices of unit trace that describe the state of a quantum system. The goal of the paper is to develop minimax lower bounds on error rates of estimation of low rank density matrices in trace regression models used in quantum state tomography (in particular, in the case of Pauli measurements) with explicit dependence of the bounds on t...
متن کاملAdaptive estimation of the copula correlation matrix for semiparametric elliptical copulas
We study the adaptive estimation of copula correlation matrix for the semi-parametric elliptical copula model. In this context, the correlations are connected to Kendall’s tau through a sine function transformation. Hence, a natural estimate for is the plug-in estimator ̂ with Kendall’s tau statistic. We first obtain a sharp bound on the operator norm of ̂ − . Then we study a factor model of , ...
متن کاملOn the Exponentially Weighted Aggregate with the Laplace Prior
In this paper, we study the statistical behaviour of the Exponentially Weighted Aggregate (EWA) in the problem of high-dimensional regression with fixed design. Under the assumption that the underlying regression vector is sparse, it is reasonable to use the Laplace distribution as a prior. The resulting estimator and, specifically, a particular instance of it referred to as the Bayesian lasso,...
متن کامل